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RECOMBINANT a-N-ACETYLGALACTOSAMINIDASE 
EliTZYKE AND CDNA ENCODING SAID EN2YME 

Statement of Government Interest 
This invention was made with government support under 
NMRDC Grant Number N0014-90-J-1638 . As such, the government 
has certain rights in the invention. 

CROSS REFERENCE TO RELATED APPLICATIONS 
This Application is a Continuation-In-Part of 
Application Serial No . 07/964 , 756 filed October 22 , 1992 , 
entitled PREPARATION OF ENZYME FOR CONVERSION OF SUB-TYPE A AND 
AB ERYTHROCYTES . 

FIELD OF THE INVENTION 
This invention relates to a recombinant enzyme for use 
in the removal of type A antigens from the surface of cells in 
blood products, thereby converting certain sub-type A blood 
products to type ~ CT blood piToducts" and cerl^in t^^ 
products to type B blood products. This invention further 
relates to methods of cloning and expressing said recombinant 
enzyme. More particularly, this invention is directed to a 
recombinant chicken liver a-N-acetylgalactosaminidase enzyme, 
methods of cloning and expressing said recombinant 
a-N-acetylgalactosaminidase enzyme, and a method of removing 
type A antigens from the surface of cells in type A and AB 
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blood products using said recombinant 

a-N-acetylgalactosaminidase enzyme by contacting said enzyme 
with blood products so as to remove the terminal moiety of the 
A-antigenic determinant from the surface of cells (for example. 
5 erythrocytes) in said blood products « while allowing the 
structure and function of the cells in the blood products to 
remain intact. The recombinant a-N-acetylgalactosaminidase 
enzyme of this invention provides a readily available and 
cost-efficient enzyme which can be used in the removal of type 

10 A antigens from the surface of cells in type A and AB blood 
products. Treatment of certain sub-type A blood products with 
the recombinant enzyme of this invention provides a source of 
cells free of the A antigen^ which blood products are thereby 
rendered useful in transfusion therapy in the same manner of O 

15 type blood products . 

BACKGROUND OF THE INVENTION 
As used herein, the term "blood products" includes 
whole blood and cellular components derived from blood, 
including erythrocytes (red blood cells) and platelets. 
20 There are more than thirty blood group (or type) 

systems, one of the most important of which is the ABO system. 
This system is based on the presence or absence of antigens A 
and/or B. These antigens are found on the surface of 
erythrocytes and on the surface of ail endothelial and most 
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epithelial cells as well. The major blood product used for 
transfusion is erythrocytes, which are red blood cells 
containing hemoglobin, the principal function of which is the 
transport of oxygen. Blood of group A contains antigen A on 
5 its erythrocytes. Similarly, blood of group B contains antigen 
B on its erythrocytes. Blood of group AB contains both 
antigens, and blood of group O contains neither antigen. 

The blood group structures are glycoproteins or 
glycolipids and considerable work has been done to identify the 

10 specific structures making up the A and B determinants or 
antigens. It has been found that the blood group specificity 
is determined by the nature and linkage of monosaccharides at 
the ends of the carbohydrate chains. The carbohydrate chains 
are attached to a peptide or lipid backbone which is embedded 

15 in the lipid bi-layer of the membrane of the cells. The most 
important ( immuno-dominant or immuno*determinant ) sugar has 
been found to be N-acetylgalactos amine for the type A antigen 
and galactose for the type B antigen. 

There are three recognised major sub-types of blood- 

20 type A. These sub-types are known as Aj^, A intermediate 
^^int^ and A^. There are both quantitative and qualitative 
differences which distinguish these three sub-types. 
Quantitatively, erythrocytes have more antigenic A sites, 
i.e., terminal N-acetylgalactosamine residues, than ^^^^ 

25 erythrocytes which in turn have more antigenic A sites than 
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erythrocytes. Qualitatively, the transferase enzymes 

responsible for the formation of A antigens differ 

biochemically from each other in A. , A. . and A^ 

1 mt 2 

individuals. Some A antigens found in A^ cells contain dual 
5 A antigenic sites. 

Blood of group A contains antibodies to antigen B. 
Conversely, blood of group B contains antibodies to antigen A. 
Blood of group AB has neither antibody, and blood group O has 
both. A person whose blood contains either (or both) of the 

10 anti-A or anti-B antibodies cannot receive a traoisfusion of 
blood containing the corresponding incompatible antigen(s). If 
a person receives a transfusion of blood of an incompatible 
group, the blood transfusion recipient's antibodies coat the 
red blood cells of the transfused incompatible group and cause 

15 the transfused red blood cells to agglutinate, or stick 
together. Transfusion reactions and/or hemolysis (the 
destruction of red blood cells) may result therefrom. 

In order to avoid red blood cell agglutination, 
transfusion reactions and hemolysis, transfusion blood type is 

20 cross-matched against the blood type of the transfusion 
recipient. For example, a blood type A recipient can be safely 
transfused with type A blood which contains compatible 
antigens. Because type O blood contains no A or B antigens, it 
can be transfused into any recipient with any blood type, i.e., 

25 recipients with blood types A, B, AB or O. Thus, type O blood 
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is considered "iiniversal" , and may be used for all 
transfusions. Hence, it is desirable for blood banks to 
maintain large quantities of type O blood. However, there is a 
paucity of blood type O donors. Therefore, it is useful to 
5 convert types A, B and AB blood to type O blood in order to 
maintain large quantities of universal blood products. 

In an attempt to increase the supply of type O blood, 
methods have been developed for converting certain type A* B 
and AB blood to type O blood. For example, U.S. Patent No. 

10 4,609,627 entitled "Enzymatic Conversion of Certain Sub-Type A 
and AB Erythrocytes" ("the '627 Patent"), which is incorporated 
herein by reference, is directed to a process for converting 
A^^^ and A^ (including A^B erythrocytes) to erythrocytes 
of the H antigen type, as well as to compositions of type B 

15 erythrocytes which lack A antigens, which compositions, prior 
to treatment, contained both A and B antigens on the surface of 
said erythrocytes. The process for converting A^^^^ and A2 
erythrocytes to erythrocytes of the H antigen type which is 
described in the '627 Patent includes the steps of 

20 equilibrating certain sub-type A or AB erythrocytes, contacting 
the ecpiilibrated erythrocytes with purified chicken liver 
a-N-acetylgalactosaminidase enzyme for a period sufficient to 
convert the A antigen to the H antigen, removing the enzyme 
from the erythrocytes and re-equilibrating the erythrocytes. 

25 As described in the '627 Patent, a-N-acetylgalactosaminidase 
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obtained from an avian liver (specif ically/ chicken liver) 
source was found to have superior activity in respect of 
enzymatic conversion or cleavage of A antigenic sites. 

Prior to the present invention, it was necessary to 
5 purify the enzyme from an avian liver source, a process which 
is time consuming and can be expensive. Hence, a need has 
arisen to develop an enzyme source which is more readily 
available. In addition, a need has arisen to develop an enzyme 
useful in blood product conversion which enzyme is 

10 cost-efficient. 

A simplified purification process is described in a 
related application. Serial No. 07/964,756, filed October 22, 
1992, entitled "Preparation of Enzyme for Conversion of 
Sub-Type A and AB Erythrocytes**. This process, as described in 

15 the related application, utilizes chicken liver as a source of 
enzyme and, therefore, requires a number of purification 
steps. Despite this simplified process, it is still desirable 
to provide a more readily available and controlled source of 
enzyme, that being cloned and expressed enzyme. This would 

20 provide an enzyme source which is more consistent and which is 
readily purified at less cost and expense, with a still further 
reduced number of purification steps. Additionally, a 
recombinant, cloned enzyme allows for specific protein sequence 
modifications, which can be introduced to generate an enzyme 

25 
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with optimized specific activity, substrate specificity and *pH 
range. 

a-N-acetylgalactpsaininidase enzymes are 

characterized (and thereby named) by their ability to cleave 
5 N-acetylgalactosamine sugar groups. In isolating or 
identifying these enzymes, their activity is assessed in the 
laboratory by evaluating cleavage of synthetic stibstrates which 
mimic the sugar groups c leaved by the enzymes , with 
p-nitrophenylglycopyranoside derivatives of the target sugar 

10 groups being commonly used. Although very useful in enzyme 
identification and isolation procedures (the quantitative 
cleavage of these synthetic sxibstrates can be used to readily 
distinguish (and thereby identify) enzymes isolated from 
different sources), these synthetic substrates are simple 

15 structurally and small-sized and mimic only a portion of the 
natural glycoproteins and glycolipid structures which are of 
primary concern, those being the A antigens on the surface of 
cells. 

_„ — n«ural~ glycolipid "substratev originally — isolated 

20 from sheep erythrocytes, is the Forsmann antigen 
(globopentaglycosy leer amide) . The Forsmann antigen substrate 
appropriately mimics the natural A antigen glycolipid 
structures and is therefore utilized to predict the activity of 
a-N-acetylgalactosaminidase enzymes against the A antigen 
25 substrate. Isolated Forsmann antigen glycolipids have been 
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shown to inhibit hemolysis of sheep red cells by immune rabbit 
anti-A servun in the presence of serum complement. 

a-N-acetylgalactosaminindase enzyme has been 
isolated from a number of sources besides chicken liver 
5 (described above), including bacteria, mollusks, earthworms, 
and human liver. The human a-N-acetylgalactosaminidase 
enzyme has been purified, sequenced, cloned and expressed. For 
example, in "Human a-»N-Acetylgalactosaminidase - Molecular 
Cloning, Nucleotide Sequence and Expression of a Full-length 

10 cDNA", by Wang et al., in The Journal of Biological Chemistry . 

Vol. 265, No. 35, pages 21859-21866 (DecMtber 15, 1990), the 
cDNA encoding hiiman a-N-acetylgalactosaminidase was 
sequenced. In addition, in "Molecular Cloning of a Full-Length 
cDNA for Hianan a-N-Acetylgalactosaminidase (a-Galactosidase 

15 B)", by Tsuji et al . , in Biochemical And Biophysical Research 
Commun i c a t i ons , Vol. 163, No. 3, pages 1498-1504 (September 29, 
1989), the cDNA encoding human a-N-acetylgalactosaminidase 
was sequenced. Both the nucleotide sequence and the amino acid 
sequence of human a-N-acetylgalactosaminidase is pxiblished 

20 therein. Further, PCT Application No. WO 92/07936 discloses 
the cloning and expression of the cDNA which encodes human 
a-N-acetylgalactosaminidase . 

Although human a-N-acetylgalactosaminidase has been 
purified, sequenced, cloned and expressed, it is not 

25 appropriate for use in removing A antigens from the surface of 
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cells in blood products. In determining whether an enzyme is 
appropriate for use in removing A antigens from the surface of 
cells, one roust consider the following enzyme characteristics, 
particularly with respect to the Forsmann antigen substrate: 
5 substrate specificity, specific activity or velocity of the 
substrate cleavage reaction, suid pH optimum. Substrate 
specificity is measured in the Km value, which measures the 
binding constant or affinity of an enzyme for a particular 
sxibstrate. The lower a Km value, the more tightly an enzyme 

10 binds its substrate. The velocity of an enzyme cleavage 
reaction is measured in the Vroax, the reaction rate at a 
saturating concentration of substrate. A higher Vmax indicates 
a faster cleavage rate. The ratio of these two parameters, 
Vmax/Km, is a measure of the overall efficiency of an enzyme in 

15 reacting with (cleaving) a given substrate. A higher Vmax/Km 
indicates greater enzyme efficiency. For successful and 
clinically applicable removal of A antigens from the surface of 
cells, the enzyme roust be sufficiently active at or above a pH 
at which the cells being treated can be maintained. The 

20 procedure described in the *627 patent calls for treatment of 
cells at or above a pH of 5.6. Therefore, the pH optimum of an 
appropriate enzyme roust still provide reasonable enzyme 
activity at this pH. These specific characteristics (Vmax/Km, 
Vmax, Km and pH optimum) are reported for the human 

25 a-N-acetylgalactosaminidase enzyme in "Studies on Human 
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Liver a-galactosidases" . by Dean et al. in The Journal of 
Biological Chemistry , Vol. 254, No. 20, pages 10001-10005 
(1979). 

The Vmax/Km value for the Forsmann antigen of human 
5 a-N-acetylgalactosaminidase is 0.46, as compared to a Vmax/Km 
value of 5.0 for the chicken liver enzyme, indicating an 
approximately ten-fold difference in efficiency. The Km is 
lower and the Vmax is higher for the chicken liver enzyme, 
compared to the human enzyme. Further, human 

10 a-N-acetylgalactosaminidase has a pH optimum for the Forsmann 
antigen of 3.9, compared to 4.7 for chicken liver 
a-N-acetylgalactosaminidase. By all of these enzyme 
characteristics, human a-N-acetylgalactosaminidase enzyme is 
not suitable for removal of A antigens, particularly when 

15 compared to the chicken liver enzyme. 

As a result, a need still existed to develop an enzyme 
which is capable of removing A antigens from the surface of 
cells in blood products, wherein said enzyme is readily 
available and cost-efficient. 

20 It is therefore an object of this invention to provide 

a recombinant enzyme for use in the removal of A antigens from 
the surface of cells in blood products. 

It is another object of this invention to provide a 
recombinant enzyme for use in the removal of A antigens from 

25 the surface of cells in blood products wherein said enzyme is 
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readily available and may be manufactured on a cost-efficient 
basis. 

It is a further object of this invention to provide 
methods of cloning and expressing a recombinant enzyme useful 
5 in the removal of A antigens from the surface of cells in blood 
products . 

It is yet another object of this invention to provide 
a method of removing A antigens from the surface of cells in 
blood products using a recombinant enzyme. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

The above brief description* as well as further 
objects and features of the present invention, will be more 
fully understood by reference to the following detailed 
description of the presently preferred, albeit illustrative, 
15 ©nbodiment of the present invention when taken in conjunction 
with the accompanying drawing wherein: 

Figure 1 represents a diagram of the strategy used to 

- clone and sec[uence the chicken liver 

a-N-acetylgalactosaminidase cDNA; 
20 Figure 2 represents the nucleic acid sequence and the 

deduced amino acid sequence of the chicken liver 
a-N-acetylgalactosaminidase cDNA clone; 

Figure 3 represents the expression of chicken liver 
a-N-acetylgalactosaminidase in bacteria and rabbit 
25 reticulocyte lysate as shown by Western blot; 
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Figure 4 represents a homology comparison between 
(j^-^.^cetylgalactosazninidases and a-galactosidases; and 

Figure 5 represents the expression of chicken liver 
a-N-acetylgalactosaminidase in yeast as shown by Western blot. 

5 SUMMARY OF THE INVENTION 

This invention is directed to a recombinant chicken 
liver a-N-acetylgalactosaminidase enzyme, which enzyme has a 
molecular weight of about 45 kDa, is immiinoreactive with an 
antibody specific for chicken liver 

10 a-N-acetylgalactosaminidase^ and also has about 80% amino 
acid sequence homology with human a-N-acetylgalactosaminidase 
enzyme. The recombinant chicken liver 

a-N-acetylgalactosaminidase enzyme of this invention has the 
amino acid sequence depicted in Figure 2, from amino acid 

15 number 1 to amino acid number 406. This invention is further 
directed to methods of cloning and expressing the recombinant 
chicken liver a-M-acetylgalactosaminidase enzyme, and to a 
method of using said enzyme to remove A antigens from the 
surface of cells in blood products so as to convert said blood 

20 products of certain A sub-types to type O, thereby rendering 
said blood products xiniversal for use in transfusion therapy. 

DETAILED DESCRIPTION OF THE INVENTION 
This invention is directed to a recombinant enzyme for 
use in the removal of type A antigens from the surface of cells 
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in blood products, thereby converting certain sub-type A blood 
products to type O blood products and certain sub-type AB blood 
products to type B blood products. The recombinant chicken 
liver a-N-acetylgalactosaminidase enzyme of this invention 
5 . has a molecular weight of about 45 IcDa and is immiinoreactive 
with an antibody specific for chicken liver a-N- 
acetylgalactosaminidase. In addition, the recombinant enzyme 
of this invention has about 80% amino acid sequence homology 
with human a-N-acetylgalactosaminidase enzyme. The 
10 recombinant chicken liver a-N-acetylgalactosaminidase enzyme 
of this invention has the following nucleic acid and deduced 
amino acid sequence: 



SEQ ID NO 1; 



15 


ATG 
Met 


CTG 
Leu 


GAG 
Glu 


AAC 
Asn 


GGG CTG GCG 
Gly Leu Ala 


CGG 
Arg 


ACC 
Thr 


CCG 
Pro 


CCC 
Pro 


ATG 
Met 


GGC 
Gly 


TGG TTG 
Trp Leu 


GCC 
Ala 




TGG 
Trp 


GAG 

Glu 


CGG 
Arg 


TTC 
Phe 


CGC TGC AAC 
Arg Cys Asn 


GTG 
Val 


AAC 
Asn 


TGC 
Cys 


CGG 
Arg 


GAG 
Glu 


GAC 
Asp 


CCC CGC 
Pro Arg 


CAG 
Gin 




TGC 
Cys 


ATC 
He 


AGT 
Ser 


GAG 
Glu 


ATG CTC TTC 
Net Leu Phe 


ATG 
Met 


GAG 
Glu 


ATG 
Met 


GCA 
Ala 


GAC 
Asp 


CGA 
Arg 


ATA GCA 
He Ala 


GAG 
Glu 


20 


GAC 
Asp 


GGC 
Gly 


TGG 
Trp 


AGG 
Arg 


GAG CTG GGC 
Glu Leu Gly 


TAG 
Tyr 


AAG 
Lys 


TAC 
Tyr 


ATC 

He 


AAT 

Asn 


ATC 
He 


GAT GAC 
Asp Asp 


TGC - 

cys 




TGG 
Trp 


GCC 
Ala 


GCC 
Ala 


AAG 
Lys 


CAG CGT GAC 
Gin Arg Asp 


ACT 
Thr 


GAG 
Glu 


GGG 
Gly 


CGG 
Arg 


CTG 
Leu 


GTG 
Val 


CCT GAC 
Pro Asp 


CCC 
Pro 


25 


GAG 
Glu 


AGG 
Arg 


TTC 
Phe 


CCC 
Pro 


CGG GGC ATT 
Arg Gly He 


AAG 
Lys 


GCC 
Ala 


TTG 
Leu 


GCT 
Ala 


GAC 
Asp 


TAC 
Tyr 


GTT CAT 
Val His 


GCC 
Ala 




CGA 
Arg 


GGC 
Gly 


TTG 
Leu 


AAG 
Lys 


CTG GGC ATT 
Leu Gly He 


TAT 
Tyr 


GGC 
Gly 


GAC 
Asp 


CTG 
Leu 


GGC 
Gly 


AGA 
Arg 


CTC ACC 
Leu Thr 


TGT 
Cys 




GGA 
Gly 


GGC 
Gly 


TAC 
Tyr 


CCA 
Pro 


GGC ACC ACG 
Gly Thr Thr 


CTG 
Leu 


GAC 
Asp 


CGT 
Arg 


GTG 
Val 


GAG 
Glu 


CAG 
Gin 


GAC GCA 
Asp Ala 


CAG 
Gin 
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ACC 
Thr 


TTC 
Phe 


GCT 
Ala 


GAG 
Glu 


TGG 
Trp 


GGT GTG 
Gly Val 


GAC 
Asp 


ATG 
Met 


CTG 
Leu 


AAG 
Lys 


CTA 
Leu 


GAT 
Asp 


GGG 
Gly 


TGC 
Cys 


TAC 
Tyr 




TCA 
Ser 


TCG 
Ser 


GGG AAG 
Gly Lys 


GAG 
Glu 


CAG 
Gin 


GCA 
Ala 


CAG 
Gin 


GGC TAC 
Gly Tyr 


CCA 
Pro 


CAA 
Gin 


ATG 
Met 


GCA 
Ala 


Arg 


GPP 

Ala 


5 


TTG 
Leu 


AAC 
Asn 


GCC 
Ala 


ACT 
Thr 


GGC 
Gly 


CGC 
Arg 


CCC 
Pro 


ATC 

He 


GTC 
Val 


TAC 
Tyr 


TCC 
Ser 


TGC 
Cys 


AGC 
ser 


TGG 
Trp 


Pro 


Ala 




TAC 
Tyr 


CAG GGG GGG CTG 
Gin Gly Gly Leu 


CCT 
Pro 


CCC 
Pro 


AAG 
Lys 


GTG 
Val 


AAC 
Asn 


TAC 
Tyr 


ACT 
Thr 


CTC 
Leu 


CTG 
Leu 


Gly 


Glu 


10 


ATC TGC 

lie Cys 


AAC 

Asn 


CTG 
Leu 


TGG 
Trp 


CGG 
Arg 


AAC 

Asn 


TAC 

Tyr 


GAT GAC 
Asp Asp 


ATC 

He 


CAG 
Gin 


GAC 
Asp 


TCA 
Ser 


Trp 


Asp 




A6C 

Ser 


GTG 
Val 


CTT 
Leu 


TCC 
ser 


ATC 
He 


GTG 
Val 


GAC 
Asp 


TGG 
Trp 


TTC 
Phe 


TTC 
Phe 


ACA 
Thr 


AAC CAG GAT 
Asn Gin Asp 


GTG 
Val 


CTG 
Leu 




CAG 
Gin 


CCG 
Pro 


TTT GCT GGC 
Phe Ala Gly 


CCT GGC CAC 
Pro Gly His 


TGG AAT GAC 
Trp Asn Asp 


CCA GAC ATG 
Pro Asp Met 


CTC 
Leu 


ATC 

He 


15 


ATT GGA AAT TTC GGT 
He Gly Asn Phe Gly 


CTC 
Leu 


AGC 
Ser 


TAT 
Tyr 


GAG 
Glu 


CAG 
Gin 


TCA 
Ser 


CGT 
Arg 


TCC 
Ser 


CAA 
Gin 


ATG 

Met 


^Ala 




TTG TGG 
Leu Trp 


ACC 
Thr 


ATT 
He 


ATG 
Met 


GCA 
Ala 


GCT 
Ala 


CCA 
Pro 


CTC 
Leu 


CTC 
Leu 


ATG 

Met 


TCC 
ser 


ACC 
Thr 


GAC 
Asp 


CTG 
Leu 


Arg 


20 


ACT 
Thr 


ATC 
He 


TCG 
Ser 


CCG 
Pro 


AGT 
Ser 


GCC 
Ala 


AAG 

Lys 


AAG 
Lys 


ATT 

He 


CTG 
Leu 


CAG 
Gin 


AAC CGC 
Asn Arg 


CTG 
Leu 


Met 


Al w 

He 




CAG 
Gin 


ATA AAC CAG GAC 
He Asn Gin Asp 


CCC TTG GGA 
Pro Leu Gly 


ATC CAG GGG 
He Gin Gly 


CGC AGG 
Arg Arg 


ATC 
He 


He 


AAv 

Lys 




GAG GGA TCC 
Glu Gly Ser 


CAC 
His 


ATT 
He 


GAG 
Glu 


GTG 
Val 


TTC 
Phe 


CTG CGC CCG 
Leu Arg Pro 


CTG 
Leu 


TCA 
Ser 


CAG 
Gin 


GCT 
Ala 


GCC 
Ala 


25 


AGT 
Ser 


GCC 
Ala 


CTG 
Leu 


GTC 
Val 


TTC 
Phe 


TTC 
Phe 


AGC 
Ser 


CGG 
Arg 


AGG ACA GAC 
Arg Thr Asp 


ATG 
Met 


CCC 
Pro 


TTC 
Phe 


CGC 
Arg 


TAC 
Tyr 




ACC 
Thr 


ACC 
Thr 


AGT 
Ser 


CTT 
Leu 


GCC 
Ala 


AAG 
Lys 


CTT GGC 
Leu Gly 


TTC 
Phe 


CCC 
Pro 


ATG 
Met 


GGA GCT GCA 
Gly Ala Ala 


TAT 
Tyr 


GAG 
Glu 


3G 


GTG 
Val 


CAA 
Gin 


GAC 
ASfp 


GTG 
Val 


TAC 
Tyr 


AGT 
Ser 


GGG 
Gly 


AAG 
Lys 


ATC 
He 


ATC 
He 


AGT 
Ser 


GGC CTG 
Gly Leu 


AAG 
Lys 


ACA 
Thr 


GGA 
Gly 




GAC AAC 
Asp Asn 


TTC 
Phe 


ACA 
Thr 


GTG 
Val 


ATC 
He 


ATC 
He 


AAC 
Asn 


CCC 
Pro 


TCA 
Ser 


GGG 
Gly 


GTG 
Val 


GTG 
Val 


ATG 
Met 


TGG 
Trp 


TAC 
Tyr 




CTG TGT 
Leu Cys 


CCC 
Pro 


AAA 
Lys 


GCA 
Ala 


CTG 
Leu 


CTC 
Leu 


ATC 
He 


CAG 
Gin 


CAG 
Gin 


CAA 
Gin 


GCT 
Ala 


CCT 
Pro 


GGG 
Gly 


GGG 
Gly 


CCC 
Pro 
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TCG CGC CTG CCC CTT CTG TGA 
Ser Arg Leu Pro Leu Leu 

ATC TCA CCG CTG CTC AAG TGC 

ATG CAG CTT GCT CCT CTG GCA 

5 GTG AAG CAG GAC TTC TTG TTA 

CTG AGG TCC TCC TGT GGA AGA 

GAC CAG ATG TGG TCA CCT TTT 

CTG TCC ACA GGC TGC ACC CCT 

AGC TCC TGG GGC CAG GCT GTC 

10 ACA CCA CTA CAG TGT AGT CCG 
AAC GTG TGC CAG TAG TGT ACT 
AAC CTG AGT TGT CCA TGC ACA 
CAT GAA AAA TTG CAG TCA CAG 
GAT GCT GAT GCT TGT TCG TGG 

15 GTG CTG GCC AAG TTG CAT CTC 
TTT CTC AGG GAA AGG CTC TTC 
AGO GGC TCA GAA GGA CGC ACG 
CCT GAC CAC CCC AAG AGG GGT 
CAA GGC CAG GTC AGC CCA GGG 

20 CAG GAA ATC ACC CTA TTT TAA 
CTA TTC AGA TGA GGG CTA GAA 
GGA TTC CTG CCC CTT GCA GAG 
ATG TGA TCT TTG CAG CGC TGG 
TGG GAG ATG GCT TTG TTT TTG 

25 AGG GAA GCA GAG GGG 
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GGC 


CCA 


TGA 


TTG 


GGA 


GCC 


CTG 


GGA 


TAC 


CTT 


CTT 


CTG 


GTG 


TGG 


CTG 


GGG 


GAG 


GAC 


CCA 


CCT 


GAT 


GAT 


TTC 


TAC 


TCA 


TTC 


CAC 


CTC 


CCT 


CCT 


GAG 


AGC 


ATG 


CAA 


AGC 


GCT 


GGA 


GTG 


TTC 


CCA 


GTG 


ACC 


ATC 


CTT 


TAG 


TTC 


CTT 


TGC 


TTG 


GCT 


TAG 


GAC 


AAA 


GGG 


CTT 


CCC 


AGG 


CAC 


CAT 


CCC 


CAG 


ACC 


AGG 


TCT 


GTC 


TGG 


CAG 


CAG 


GAT 


CAG 


CAG 


GTA 


CAC 


ATA 


ATG 


AAA 


AAG 


AAA 


TCT 


AAA 


CAA 


GAA 


CCC 


GCT 


CTG 


GTT 


ACA 


GCA 


GAG 


CAA 


ATC 


CCA 


GTA 


TCC 


TCA 


CTG 


TGG 


TGT 


TAG 


TGC 


ATT 


GTG 


CAC 


GAG 


TGG 


TGT 


CTG 


GAA 


TGG 


TCT 


TAA 


GGT 


GGG 


AGA 


TGC 


TCA 


TGG 


AAT 


CTT 


GTG 


AGG 


CTG 


AAC 


CTT 


CCA 


GCA 


CTT 


TTA 


AAG 


GCA 


GCC 


TGC 


ACA 


AAT 


AGA 


AGG 


AGG 


GGC 


TCA 


GGT 


GGG 


CCG 


TGC 


TCC 


CAA 


CTA 


CTC 


ACC 


AAA 


ATC 


TAC 


CCC 


TTT 


AGA 


CGC 


ACC 


CAA 


GGT 


TAA 


ACC 


TCA 


AAA 


ATT 


AGT 


GAG 


AAA 


TTG 


AAC 


TTC 


CCC 


ATT 


GCC 


CAC 


TCT 


CCT 


TAG 


AAG 


GCA 


CGT 


GGT 


ACA 


TTG 


TGG 


TCT 


GAA 


GCA 


AGA 


TGC 


TGA 


AAA 


TGA 


CAT 


GTC 


TGT 


TTC 


ATG 


CTT 


GTG 


TGA 


TTT 


TGA 


CAA 


TTT 


AAC 


TGA 


AAT 


AAA 
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A DNA vector containing a sequence encoding chicken 
liver a-N-acetylgalactosaininidase was deposited under the 
Budapest Treaty with the American Type Culture Collection, 
Rockville, Maryland, on March 17, 1993, tested and found viable 
5 on March 22, 1993 and catalogued as ATCC No. 75434. 

The recombinant chicken liver a-N- 

acetylgalactosaminidase enzyme o£ this invention can be cloned 
and expressed so that it is readily available £or use in the 
removal of A antigens from the surface of cells in blood 

10 products. The enzyme of this invention can be cloned and 
expressed by screening a chicken liver cDNA library to obtain 
the cDNA sequence which encodes the chicken liver 
a-N-acetylgalactosaminidase, sequencing the encoding cDNA 
once it is determined, cloning the encoding cDNA and expressing 

15 a-N-acetylgalactosaminidase from the cloned encoding cDNA. 
This may be performed by obtaining an amplified human 
a*N-acetylgalactosaminidase fragment capable of use as a 
screening probe, screening a chicken liver cDNA library, such 
as the one described hereinabove, using the amplified hioman 
a-N-acetylgalactosaminidase fragment as a probe so as to 
obtain the cDNA sequence of the chicken liver cDNA library 
which encodes chicken liver a*N-acetylgalactosaminidase, 
sequencing the encoding DNA, cloning the encoding DNA and 
expressing chicken liver a-N-acetylgalactosaminidase enzyme 
from the cloned encoding cDNA. Alternatively, screening can be 
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performed using antibodies which recognize chicken liver 
a-N-acetylgalactosaminidase . 

Methods which are well known to those skilled in the 
art can be used to construct expression vectors containing the 
5 chicken liver a-N-acetylgalactosaxninidase coding sequence ^ 
with appropriate transcriptional/translational signals for 
expression of the enzyme in the corresponding expression 
systems. Appropriate organisms, cell types and expression 
systems include: cell-free systems such as a rabbit 

10 reticulocyte lysate system, prokaryotic bacteria « such as 
E. coli , eukaryotic cells « such as yeast, insect cells, 
mammalian cells (including human hepatocytes or Chinese hamster 
ovary (CHO) cells), plant cells or systems, and animal systems 
including oocytes and transgenic animals. 

15 The entire chicken liver a-N-acetylgalactosaminidase 

coding sequence or functional fragments of functional 
equivalents thereof may be used to construct the above 
expression vectors for production of functionally active enzyme 
— in"the "corresponding expression systKR^ — Due to the degeneracy- 

20 of the DNA code, it is anticipated that other DNA sequences 
which encode substantially the same amino acid sequence may be 
used. Additionally, changes to the DNA coding sequence which 
alter the amino acid sequence of the chicken liver 
a-N-acetylgalactosaminidase enzyme may be introduced which 

25 result in the expression of functionally active enzyme. In 
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particular, amino acid substitutions may be introduced which 
are based on similarity to the replaced amino acids, 
particularly with regard to the charge, polarity, 
hydrophobic ity, hydrophilicity , and size of the side chains of 
5 the amino acids. 

Once a recombinant chicken liver 

a-N-acetylgalactosaminidase enzyme is cloned and expressed, 
said enzyme can be used to remove A antigens from the surface 
of cells in blood products. Methods of utilizing chicken liver 

10 a-N-acetylgalactosaminidase to remove A antigens from the 
surface of erythrocytes can be found in U.S. Patent No. 
4,609,627 issued September 2, 1986 to Goldstein, entitled 
"Enzymatic Conversion of Certain sub-type A and AB 
Erythrocytes", which is incorporated herein by reference. 

15 Sub-type A antigens can be removed from the surface of 
erythrocytes by contacting the erythrocytes with the 
recombinant chicken liver a-N-acetylgalactosaminidase enzyme 
of this invention for a period of time sufficient to remove the 
A antigens from the surface of the erythrocytes. 

20 EXAMPLE 

Isolation and Characterization 
of the Chicken Liver cDNA Clone 

Chicken liver a-N-acetylgalactosaminidase was 

purified to homogeneity. The enzyme was a glycoprotein with a 

25 molecular weight of 80 kDa, and was dissociated into two 
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identical siibunits at pH 7.5. Its optimal pH for cleavage of 
the synthe t i c p-n i t r opheny 1 -a-N- ac ety Igalactosaminyl- 

pyranoside sxibstrate was 3.65 and the activity dropped sharply 
when the pH was raised above 7* The N-terminal sequenc 
5 obtained from the purified chicken liver 

a-N-acetylgalactosaminidase showed a strong homology with the 
corresponding sequence deduced from the human 
a-N-acetylgalactosaminidase cDKA clone described in Tsuji et 
al., and Wang et al. 

xo In order to isolate and characterize the cOKA clon 

for chicken liver a-N-acetylgalactosaminidase, two 
oligonucleotides, corresponding to nucleotides 688 to 705 and 
1219 to 1236 of the human a-N-acetylgalactosaminidas 
sequence published by Wang, et al. were synthesized. Using 

15 human placental mKNA (Clontech) as a template, the specific 
cDNA was made from the downstream (C-terminal) 
oligonucleotide. Next, a DNA fragment corresponding to human 
a-N-acetylgalactosaminidase residues from 688 to 1236 was 
^ ampl if ied" f r oro - the cDNA -^by the— hot-st art^ PGR technique . The — 

20 PGR reaction mixture was preheated at 95**C for 5 minutes and 
maintained at 80 •C while Taq DNA polymerase (Promega) was added 
to reduce the possible non-specific annealing at lower 
temperature. 35 cycles of amplification was then carried out 
as follows: 94**C for I minute, SCO for 2 minutes and 72*»C for 

25 3 minutes. The same conditions for PGR were applied in all of 
the following experiments. 
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The PCR*ampli£ied fragment was then used as a 
radioactively-labeled probe in the screening of a chicken liver 
cDNA library (Stratagene) based on homology hybridization. The 
filters containing the library were hybridized with the probe 
5 overnight at 42**C in a solution of 50% formaznide. 5XSSPE, 
SXDenhardt ' s, 0.1% SDS and 0.1 mg/ml salmon sperm DNA. The 
filters were then washed as follows: 

1. 3 X SSC 0.1% SDS« 20 min. room temperature 

2. 2 X SSC +0.1% SDS, 20 min. room temperature 
10 3. 1 X SSC +0.1% SDS, 20 min. 56<*C 

4. IX SSC +0.1% SDS, 20 min. se^'c 

The filters were autoradiographed overnight at *70**C. 
The positive clones were picked up for the second-round 
screening following the same procedure. In total, three 

15 consecutive screenings were carried out in order to obtain a 
well-isolated positive clone. 

From approximately one million plagues screened, one 
positive clone was successfully isolated. The sequencing data 
indicated that the clone consists of a 1.2 kb 3 '-untranslated 

20 region and a 0.7 kb coding region which is highly homologous to 
human a-N-acetylgalactosaminidase. In order to obtain the 
missing coding sec[uence, the library was rescreened by using 
the 1.9 kb cDNA clone as a probe. However, no positive clone 
was identified by this approach. 

25 
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The upstream cDNA sequence was then obtained by 
applying multiple amplification (the nested PGR technique) of. a 
second chicken liver cDNA library (Clontech). Figure l 
represents a diagram of the strategy used to clone and sequence 
5 the chicken liver a-N-acetylgalactosaminidase cDNA. The cDNA 
encoding chicken liver a*N-acetylgalactosaminidase contained 
a 1.2 kb coding region (slashed area) and a 1.2 kb 3' 
untranslated region. The arrows at the bottom of the diagram 
indicate the sequencing strategy. CLl, CL2 and CL3 are 

10 oligonucleotides used as primers for the nested PGR. CLl and 
CL2 are located at position 924-941 nt and 736-753 nt, 
respectively (see Figure 2). According to the N-terminal 
sequence of native chicken liver enzyme « the oligonucleotide 
CL3 [5 *-CTGGA6AAC(T)GGA(GC)CTGGCT(CA)CG] was designed taking 

15 into accoxjnt chicken codon usage and "best guess". 

In the first*round PGR amplif ication^ the whole cDNA 
library was used as a template in the presence of one specific 
primer (CLl) (see Figure 1} and one universal primer derived 
— ^.^^ from - the - -library- vector- - ( 5 * -GTGGTAAT<K5TAGGGACC> > A - small - 

20 aliquot from the above reaction was directly taken for the 
second-roimd amplification with a different set of primers. 
The primer CL2 had the sequence located upstream of CLl (Figure 
1) and the second primer, c:L3, was designed based on the 
N-terminal amino acid sequence from purified chicken liver 

25 a-N-acetylgalactosaminidase (see Figure 1). A 750 bp 
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fragment was sequenced to eliminate any possible PGR 
artifacts. Since the 750 bp fragment overlapped with the 
1.9 kb clone isolated by the library-screening, the two 
fragments were linked together by PGR to reconstitute the cDNA 
encoding chicken liver a-N-acetylgalactosaminidase (Figure 
1). The DNA sequencing was performed according to standard 
procedure, and the coding region was sequenced in both 
orientations . 



The Cloned DNA Encodes Chicken 
10 Liver tt'-N^Acetylqalactosaroinidase 

The authenticity of the cDNA clone was established by 

co-linearity of deduced amino acid sequences with N-terminal 

and CNBr-digested peptide sequences from purified chicken liver 

a-N-acetylgalactosaminidase. Figure 2 represents the nucleic 

15 acid sequence and deduced amino acid sec[uence of the chicken 
liver a-N-acetylgalactosaminidase cPNA clone. The underlined 
regions in Figure 2 match sequences obtained from the 
N-terminus and CNBr-derived fragments of enzyme purified from 
chicken liver. The first 3 nucleotides, ATG, were added during 

20 sxibcloning to serve as the translational initiation codon for 
protein expression. The polyadenylation signal (AATAAA) at 
positions 2299-2304 nt is double-underlined. The boxed 
sequence indicates potential sites for N-glycosylation. 
According to the cDNA, the mature protein of 405 amino acids 

25 has a molecular mass of about 45 kDa, consistent with that of 
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the purified enzyme estimated by SDS-PAGE. Due to the cloning 
approach applied, the sequence at the 5* end of the cDNA 
corresponded to the N-terminal sequence of the mature enzyme 
isolated from chicken liver. 
5 In order to express the chicken liver 

a-N-acetylgalactosaminidase in a rabbit reticulocyte lysate, 
the sequence from 1 to 1260 nucleotides which contained the 
coding region for chicken liver a-N-acetylgalactosaminidase 
was subcloned into the vector PCR-II (Invitrogen) in such an 

10 orientation that the T7 promoter was located upstream of the 
insert. Since the N-terminus of the mature protein started 
with leucine, a translational initiation codon, ATG, was added 
during the subcloning construction. The construct was then 
used as a template in a transcription-translation coupled 

15 system, TNT system (Promega) , for protein expression according 
to the procedure recommended by the manxxf acturer . 

In order to produce the recombinant 
a-N-acetylgalactosaminidase in large quantities in bacteria 
^ and -purify the enzyme in- a single-step^f ashion, the- cDNA- was. 

20 subcloned into the EcoRI site of the pTrcHis vector 
(Invitrogen) for expression in E. coli . Because of the 
sequence in the vector, the expressed enzyme contained a 
polyhistidine-tag in its N-terminus, which permitted one step 
purification by affinity chromatography from crude cell lysates. 
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Figure 3 represents the expression of chicken liver 
a-N-acetylgalactosaminidase in bacteria and rabbit 
reticulocyte lysate as shown by Western blotting. Lane 1 
through lane 4 demonstrate the results of expression in a 
5 rabbit reticulocyte lysate. The expression was carried out in 
lysate in the presence of ^^S-methionine with (lane 1) or 
without (lane 2) the expression plasmid. Next, 5 pi of the 
reaction sample was loaded to a 12% SDS-PAGE. The gel was 
dried and autoradiographed for 2 hours and a band of an 

10 apparent molecular weight of about 45KDa was visualized with 
the expression plasmid (lane l. Figure 3). In order to confirm 
the authenticity of the expressed protein, a Western blot was 
performed using a polyclonal antibody raised against 
a-N-acetylgalactosaminidase purified from chicken liver. 

15 Using non-labelled methionine instead, the same expression 
reaction was performed for a Western blot (Pr omega) as shown in 
lanes 3 and 4, with and without the expression plasmid, 
respectively. As indicated in Figure 3, the antibody 
specifically recognized a band from the reaction with 

20 expression plasmid (lane 3), but not in the control (lane 4). 
Lane 5 shows the protein expressed in bacteria and recognized 
by the same antibody on western blot. Lane 6 shows the 
a-N-acetylgalactosaminidase purified from chicken liver as a 
positive control. Molecular weight size marker (m) is 

25 indicated on the left. Hence, it was confirmed that the 
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isolated cDNA clone codes for the chicken liver 
a-N-acetylgalactosaminidase . 

Comparison of the Cloned Chicken Liver 
Sequence with other Enzyme Sequences 

5 The chicken liver a-N-acetylgalactosaminidase 

sequence was compared with pxiblished sequences of other 

a-N-acetylgalactosaminidases and a-galactosidases which 

cleave a-galactose sugar groups. Figure 4 shows a homology 

comparison between various a-N-acetylgalactosaminidases and 

10 a-galactosidases. Alignment was carried out using both the 
computer program PROSIS (Hitachi Software Engineering Corp., 
Ltd.) and manual arrangement. The amino acid sequences were 
deduced from cDNAs. Sequences I and II are of 
a-N-acetylgalactosaminidases from chicken liver and human 

15 placenta, respectively. Sequences III, IV, V and VI represent 
a-galactosidase from hxunan, yeast, Cvamopsis tetraqonoloba 
and Aspergillus niqer . respectively. Sequences IV and VI are 
truncated at the C-terminus, as indicated by **. Identical or 
conservatively substituted amino acid residues (five out of six 

20 or more) among the aligned protein sequences are boxed. The 
numbers above the sequences indicate the relative position of 
each peptide sequence. 

The deduced amino acid sequence from chicken liver 
a-N-acetylgalactosaminidase cDNA shows approximately 80% 

25 homology with the human a-N-acetylgalactosaminidase as 
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determined by PROSIS. This homology indicates the relatedneiss 
of the human and chicken liver enzymes, despite the differences 
in the specific characteristics of the enzymes, particularly 
with regard to cleavage of the Forsmann antigen, as has already 
5 been described. Also, polyclonal antibodies raised against 
chicken liver a-N-acetylgalactosaminidase enzyme do not cross 
react with the hum^n enzyme. The specific amino acids 
responsible for these differences remain to be elucidated. 

Yaxnachi et al . (1990) reported that a human 

10 a-N-acetylgalactosaminidase cDNA with an insertion of 70bp at 
the position corresponding to number 376 in Figure 4 was not 
enzymatically active in a transient expression study in COS 
cells. The data suggests that the open reading frame shift 
caused by this insertion in the C-terminal portion of the 

15 molecule is responsible for the loss of enzymatic activity, 
indicating that amino acids in the C-terminal region may be 
essential for a-N-acetylgalactosaminidase enzyme activity. 

By sequence similarity searching (BLAST) (Altschul et 
al. 1990) of available protein databases followed by seg[uence 

20 alignment using the PH05IS computer program and manual 
arrangement, it was found that a-N-acetylgalactosaminidase is 
highly homologous to a-galactosidases from human, yeast, 
cyamopsis. tetragonoloba and aspergillus niger (ranging from 55% 
to 6B% at the amino acid level). The extent of the amino acid 

25 sequence homology, as shown in Figure 4, suggests that these 
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two fxinctionally specific glycosidases might have evolved from 
a common ancestral gene. Considering the high degree of 
similarities and the nature of their substrates it is possible 
that the two exoglycosidases share a similar catalytic 
5 mechanism and the critical amino acid residues involved in both 
active sites are well conserved. The addition of chicken liver 
a-N-acetylgalactosaminidase cDNA to the family provides 
further insight into regions of the molecule which are 
important for the substrate binding specificity and enzymatic 
10 activity. Given the availability of cloned enzymes from a 
number of sources, the active site and catalytic mechanisms of 
a-N-acetylgalactosaminidase and a-galactosidase enzymes may 
now be studied by means of cDNA deletion and site-directed 
mutagenesis . 

15 Expression of Active Chicken Liver 

g-N-acetylqalactosaminidase in Yeast 

The first 46 nucleotides of human 

jt-N-acetylgalactosaminidase cDNA (Wang, et al. 1990) which 

correspond to the signal peptide sequence, were linked to the 

20 cloned chicken liver a-N-acetylgalactosaminidase coding 

region by PGR. The PGR amplified product was subcloned 

directly into the vector PCR-II { Invitrogen) . Two EcoRl sites 

flanking the insert were used to subclone the entire 

ct-N-acetylgalactosaminidase cDNA into the yeast expression 

25 vector pyES2 (Invitrogen) in such an orientation that the GAL 1 
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promoter was located upstream of the insert. The GAL 1 
promoter provides expression of the inserted cDNA clone under 
galactose inducing growth conditions in yeast. 

The yeast vector constructs were transformed into the 
5 yeast strain, INVSCI (Invitrogen) using standard procedures. 
To confirm the expression of the chicken liver 
a-N-acetylgalactosaminidase in yeast, the total proteins from 
cell extract and culture supernatant were prepared and 
separated by 12% SDS-PAGE and a Western blot performed (by 

10 standard conditions) using the polyclonal antibody raised 
against purified chicken liver a-N-acetylgalactosaminidase. 
The transformed yeast cells were grown in medium without uracil 
(Bio 101, Inc.). After 0.2% galactose induction, the cells 
were centrifuged and protein extracts were prepared using glass 

15 bead disruption. The secreted proteins in the culture 
supernatant were concentrated with a Centricon-30 (Amicon 
Division, W.R, Grace & Co.). The Western blot results are 
depicted in Figure 5. 

Lanes 1 and 8 of Figure 5 show the 

20 a-N-acetylgalactosaminidase purified from chicken liver. 
Lane 2 through lane 4 are cell extracts from the yeast 
transformed with three different pyES2 constructs: the vector 
alone (lane 2), chicken liver a-N-acetylgalactosaminidase 
cDNA coding region (lane 3), and the coding region plus signal 

25 sequence (lane 4). Lane 5 is the culture supernatant from 
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transformed yeast used in Lane 4. Lane 7 shows the molecular 
weight standard. As shown in Figure 5, while the protein 
without signal peptide was expressed within yeast cells (lane 
3), the protein with a signal peptide sequence was 
5 predominantly secreted into the media (lane 5). The larger 
molecular weight of the secreted protein observed on the 
Western blot was presumably caused by overglycosylation, as was 
observed for the expression of guar a-galactosidase in yeast 
(Fellinger, et al. 1991). 

j^O To purify the expressed a-N-acetylgalactosaminidase, 

concentrated culture supernatant was applied to an affinity 
colxunn containing aminocaproylgalactosylamine agarose. After 
washing the colximn, the bound fraction was eluted with buffer 
containing 50mM N-acetylgalactosamine. This eluate contains 

15 expressed a-N-acetylgalactosaminidase of similar molecular 
weight to that of the enzyme purified from chicken liver, as 
indicated in lane 6 in Figure 5. 

The expressed enzyme eluted from the column 

demonstrates activity toward -the synthetic -substrate 

20 p-nitrophenyl-a-N-acetylgalactosaminylpyranoside at pH 3.6. 
Heavily glycosylated enzyme did not bind to the affinity column 
and showed no activity against synthetic substrate. All the 
data taken together demonstrate production, secretion and 
purification of enzymatically active chicken liver 

25 a-N-acetylgalactosaminidase in yeast cells. 
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Although the invention herein has been described with 
reference to particular embodiments, it is to be understood 
that these embodiments are merely illustrative of various 
aspects of the invention. Thus, it is to be understood that 
numerous modifications may be made in the illustrative 
embodiments and other arrangements may be devised without 
departing from the spirit and scope of the invention. 
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SEQUENCE LISTING 

(I) GENERAL INF0K>1ATI0N : 

(i) APPLICANT: Alex Zhu 

Jack Goldstein 



<ii) TITLE OF INVENTION: Recombinant a-N- 

Acetylgalactosaminidase 
Enzyme and cDNA Encoding 
Said Enzyme 



(iii) NUMBER OF SEQUENCES 
10 (iv) CORRESPONDENT ADDRESS: 

15 



(A) 


ADDRESSEE : 


Aroster^ Rothstein & Ebenstein 


(B) 


STREET: 


90 Park Avenue 


(C) 


CITY: 


New York 


<D) 


STATE: 


New York 


(E> 


COUNTRY: 


U.S.A. 


(F) 


ZIP: 


10016 



(V) COMPUTER READABLE FORM: 

(A> MEDIUM TYPE: 3.5 inch 1.44 Mb storage diskette 

(B) COMPUTER: IBM PC Compatible 

20 (c) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: Word Processor (ASCII) 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Not yet assigned 

(B) FILING DATE: Not yet assigned 

-(C) CLASSIFICATION: Not yet assigned 



(vii) PRIOR APPLICATION DATA: None 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Pasqnaalini Patricia A. 

. (B) REGISTRATION NUMBER: 34.894 
30 (C) REFERENCE/DOCKET NUMBER: 63475/12 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 697-5995 

(B) TELEFAX: (212) 286-0854 or 286-0082 

(C) TELEX: TWX 710-581-4766 
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(2) INFORMATION FOR SEQ ID NO: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2319 

(B) TYPE: nucleic acid 
5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: cDNA to mRNA 

(iii) HYPOTHETICAL: no 
10 (iv> ANTI-SENSE: yes 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: chicken liver 

(B) STRAIN: 

15 (C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 
20 (H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: library 

<viii) POSITION IN GENOME: unknown 
(A) CHROMOSOME SEGMENT: 
25 (B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: chicken liver a-N- 

acerylgalacrosaminidase 

(B) LOCATION: 

30 (C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 
35 (C) JOURNAL: 

<D) VOLUME: 

(F) PAGES: 

(G) DATE: 
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(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES: 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG CTG GAG AAC GGG CTG GCG CGG ACC CCG CCC ATG GGC TGG TTG GCC 48 
Met Leu Glu Asn Gly Leu Ala Arg Thr Pro Pro Met Gly Trp Leu Ala 

TGG GAG CGG TTC CGC TGC AAC GTG AAC TGC CGG GAG GAC CCC CGC CAG 96 
Trp Glu Arg Phe Arg Cys Asn Val Asn Cys Arg Glu Asp Pro Arg Gin 

10 TGC ATC AGT GAG ATG CTC TTC ATG GAG ATG GCA GAC CGA ATA GCA GAG 144 
Cys He Ser Glu Met Leu Phe Met Glu Met Ala Asp Arg He Ala Glu 

GAC GGC TGG AGG GAG CTG GGC TAC AAG TAC ATC AAT ATC GAT GAC TGC 192 
Asp Gly Trp Arg Glu Leu Gly Tyr Lys Tyr He Asn He Asp Asp Cys 

TGG GCC GCC AAG CAG CGT GAC ACT GAG GGG CGG CTG GTG CCT GAC CCC 240 
15 Trp Ala Ala Lys Gin Arg Asp Thr Glu Gly Arg Leu Val Pro Asp Pro 

GAG AGG TTC CCC CGG GGC ATT AAG GCC TTG GCT GAC TAC GTT CAT GCC 288 
Glu Arg Phe Pro Arg Gly He Lys Ala Leu Ala Asp Tyr Val His Ala 

CGA GGC TTG AAG CTG GGC ATT TAT GGC GAC CTG GGC AGA CTC ACC TGT 336 
Arg Gly Leu Lys Leu Gly He Tyr Gly Asp Leu Gly Arg Leu Thr Cys 

20 GGA GGC TAC CCA GGC ACC ACG CTG GAC CGT GTG GAG CAG GAC GCA CAG 384 
Gly Gly Tyr Pro Gly Thr Thr Leu Asp Arg Val Glu Gin Asp Ala Gin 

ACC TTC GCT GAG TGG GGT GTG GAC ATG CTG AAG CTA GAT GGG TGC TAC 432 
Thr Phe Ala Glu Trp Gly Val Asp Met Leu Lys Leu Asp Gly Cys Tyr 

TCA TCG GGG AAG GAG CAG GCA CAG GGC TAC CCA CAA ATG GCA AGG GCC 480 
25 Ser- Ser Gly Lys Glu Gin Ala Gin Gly Tyr Pro Gin Met Ala Arg ^la 

TTG AAC GCC ACT GGC CGC CCC ATC GTC TAC TCC TGC AGC TGG CCA GCC 528 
Leu Asn Ala Thr Gly Arg Pro He Val Tyr Ser Cys Ser Trp Pro Ala 

TAC CAG GGG GGG CTG CCT CCC AAG GTG AAC TAC ACT CTC CTG GGT GAG 576 
Tyr Gin Gly Gly Leu Pro Pro Lys Val Asn Tyr Thr Leu Leu Gly Glu 

30 ATC TGC AAC CTG TGG CGG AAC TAC GAT GAC ATC CAG GAC TCA TGG GAC 624 
He Cys Asn Leu Trp Arg Asn Tyr Asp Asp He Gin Asp Ser Trp Asp 

AGC GTG CTT TCC ATC GTG GAC TGG TTC TTC ACA AAC CAG GAT GTG CTG 672 
Ser Val Leu Ser He Val Asp Trp Phe Phe Thr Asn Gin Asp Val Leu 

CAG CCG TTT GCT GGC CCT GGC CAC TGG AAT GAC CCA GAC ATG CTC ATC 720 
35 Gin Pro Phe Ala Gly Pro Gly His Trp Asn Asp Pro Asp Met Leu He 
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ATT 
He 


GGA 
Gly 


AAT 
Asn 


TTC 
Phe 


GGT 

Gly 




TTG 
Leu 


TGG 
Trp 


ACC 
Thr 


ATT 
He 


ATG 
Met 


5 


ACT 
Thr 


ATC 
He 


TCG 
Ser 


CCG 
Pro 


AGT 
Ser 




CAG 
Gin 


ATA 
He 


AAC 
Asn 


CAG 
Gin 


GAC 
Asp 


10 


GAG 
Glu 


GGA 
Gly 


TCC 
Ser 


CAC 
His 


ATT 
lie 




AGT 
Ser 


GCC 
Ala 


CTG 
Leu 


GTC 
vai 


TTC 
?ne 




ACC 
Thr 


ACC 
Thr 


AGT 

Ser 


CTT 
Leu 


GCC 

Ala 


15 


GTG 
vai 


CAA 


GAC 

ASp 


GTG 


TAC 
Tyr 




Asp 


Asn 


TTC 
Phe 


ACA 

Thr 


Val 


20 


CTG 
Leu 


TGT 
Cys 


CCC 
irro 


AAA 

Lys 


6CA 




Ser 


CV7C 

Arg 


CTG 
Leu 


Pro 


§ *inrn 

Leu 




ATC 


TCA 


CCG 


CTG 


CTC 




ATG 


CAG 


CTT 


GCT 


CCT 


25 


GTG 


AAG 


CAG 


GAC 


TTC 




CTG 


AGG 


TCC 


TCC 


TGT 




GAC 


CAG 


ATG 


TGG 


TCA 




CTG 


TCC 


ACA 


GGC 


TGC 




AGC 


TCC 


TGG 


GGC 


CAG 


30 


ACA 


CCA 


CTA 


CAG 


TGT 




AAC 


GTG 


TGC 


CAG 


TAG 



CTC 
Leu 


AGC 
Ser 


TAT 
Tyr 


GAG 
Glu 


CAG 
Gin 


TCA 
Ser 


GCA 
Ala 


GCT 
Ala 


CCA 
Pro 


CTC 
Leu 


CTC 
Leu 


ATG 
Wet 


GCC 
Ala 


AAG 
Lys 


AAG 
Lys 


ATT 
He 


CTG 
Leu 


CAG 
Gin 


CCC 
Pro 


TTG 
Leu 


GGA 
Gly 


ATC 
He 


CAG 
Gin 


GGG 
Gly 


GAG 
Glu 


GTG 
Val 


TTC 
Phe 


CTG CGC 
Leu Arg 


CCG 
Pro 


TTC 

Aril 6 


AGC 

Oct 


CGG 
Arg 


AGG ACA 
Arg Thr 


GAC 


AAV9 

Lys 


CTT 
Leu 


Gly 


TTC 
Phe 


CCC 
Pro 


ATG 

wet 


AGT 


GGG 


AAG 

L'vs 


ATC 
He 


ATC 
He 


AGT 
Ser 


ATP 

He 


He 


Asn 


CCC 
Pro 


TCA 
Ser 


GGG 
Gly 


CTG 


CTC 


ATC 


CAG 
Gin 


CAG 
Gin 


CAA 
Gin 


Leu 


TGA 
**« 


GGC 


CCA 


TGA 


TTG 


AAG 


TGC 


CTT 


CTT 


CTG 


GTG 


CTG 


GCA 


CCA 


CCT 


GAT 


GAT 


TTG 


TTA 


CTC 


CCT 


CCT 


GAG 


GGA 


AGA 


GGA 


GTG 


TTC 


CCA 


CCT 


TTT 


TTC 


CTT 


TGC 


TTG 


ACC 


CCT 


CTT 


CCC 


AGG 


CAC 


GCT 


GTC 


TCT 


GTC 


TGG 


CAG 


AGT 


CCG 


CAC 


ATA 


ATG 


AAA 


TGT 


ACT 


GAA 


CCC 


GCT 


CTG 



CGT TCC CAA ATG GCC 768 
Arg Ser Gin Wet Ala 

TCC ACC GAC CTG CGC 816 
Ser Thr Asp Leu Arg 

AAC CGC CTG ATG ATC 864 
Asn Arg Leu Wet He 

CGC AGG ATC ATC AAG 912 
Arg Arg He He Lys 

CTG TCA CAG GCT GCC 960 
Leu Ser Gin Ala Ala 

ATG CCC TTC CGC TAC 1008 
Wet Pro Phe Arg Tyr 

GGA GCT GCA TAT GAG 1056 
Gly Ala Ala Tyr Glu 

GGC CTG AAG ACA GGA 1104 
Gly Leu Lys Thr Gly 

GTG GTG ATG TGG TAC 1152 
Val Val Met Trp Tyr 

GCT CCT GGG GGG CCC 1200 
Ala Pro Gly Gly Pro 

GGA GCC CTG GGA TAC 1248 

TGG CTG GGG GAG GAC 1296 

TTC TAC TCA TTC CAC 1344 

AGC ATG CAA AGC GCT 1392 

GTG ACC ATC CTT TAG 1440 

GCT TAG GAC AAA GGG 1488 

CAT CCC CAG ACC AGG 1536 

CAG GAT CAG CAG GTA 1584 

AAG AAA TCT AAA CAA 1632 

GTT ACA GCA GAG CAA 1680 
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AAC CTG ACT TGT CCA TGC ACA ATC CCA GTA TCC TCA CTG TGG TGT TAG 
CAT GAA AAA TTG CAG TCA CAG TGC ATT GTG CAC GAG TGG TGT CTG GAA 
GAT GCT GAT GCT TGT TCG TGG TGG TCT TAA GGT GGG AGA TGC TCA TGG 
GTG CTG GCC AAG TTG CAT CTC AAT CTT GTG AGG CTG AAC CTT CCA GCA 
5 TTT CTC AGG GAA AGG CTC TTC CTT TTA AAG GCA GCC TGC ACA AAT AGA 
AGG GGC TCA GAA GGA CGC ACG AGG AGG GGC TCA GGT GGG CCG TGC TCC 
CCT GAC CAC CCC AAG AGG GGT CAA CTA CTC ACC AAA ATC TAC CCC TTT 
CAA GGC CAG GTC AGC CCA GGG AGA CGC ACC CAA GGT TAA ACC TCA AAA 
CAG GAA ATC ACC CTA TTT TAA ATT AGT GAG AAA TTG AAC TTC CCC ATT 
10 CTA TTC AGA TGA GGG CTA GAA GCC CAC TCT CCT TAG AAG GCA CGT GGT 
GGA TTC CTG CCC CTT GCA GAG ACA TTG TGG TCT GAA GCA AGA TGC TGA 
ATG TGA TCT TTG CAG CGC TGG AAA TGA CAT GTC TGT TTC ATG CTT GTG 
TGG GAG ATG GCT TTG TTT TTG TGA TTT TGA CAA TTT AAC TGA AAT AAA 
AGG GAA GCA GAG GGG 

15 (3) INFORMATION FOR SEQ IP NO : 2: 



1728 
1776 
1824 
1872 
1920 
1968 
2016 
2064 
2112 
2160 
2208 
2256 
2304 
2319 



20 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 

(B) TYPE: amino acid 

( C ) STRANDEDMESS : double 
(D) TOPOLOGY: linear 



<ii) 



MOLECULE TYPE: 

(A) DESCRIPTION: cDNA to mRNA 



(ill) 



HYPOTHETICAL: 



no 



(iv) 



ANTI-SENSE: 



yes 



25 



(V) 



FRAGMENT TYPE: 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 



chicken liver 
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(C> INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 
5 (G) CELL TYPE: 

(H) CELL LINE: 

{ I ) ORGANELLE : 

(vii) IMPIEDIATE SOURCE: library 

(viii) POSITION IN GENOME: iiziknown 
10 (A) CHROMOSOME SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: chicken liver a-N- 

15 acetylgalactosaminidase 

(B) LOCATION: 

<C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 
20 (A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 
(F) PAGES: 

25 (G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES: 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Glu Asn Gly Leu Ala Arg Thr Pro Pro Met Gly Trp Leu Ala 16 

Trp Glu Arg Phe Arg Cys Asn Val Asn Cys Arg Glu Asp Pro Arg Gin 32 

Cys He Ser Glu Met Leu Phe Met Glu Met Ala Asp Arg He Ala Glu 48 

Asp Gly Trp Arg Glu Leu Gly Tyr Lys Tyr He Asn He Asp Asp Cys 64 

35 Trp Ala Ala Lys Gin Arg Asp Thr Glu Gly Arg Leu Val Pro Asp Pro 80 

Glu Arg Phe Pro Arg Gly He Lys Ala Leu Ala Asp Tyr Val His Ala 9 6 

Arg Gly Leu Lys Leu Gly He Tyr Gly Asp Leu Gly Arg Leu Thr Cys 112 
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Gly Gly Tyr Pro Gly Thr Thr Leu Asp Arg Val Glu Gin Asp Ala Gin 128 

Thr Phe Ala Glu Trp Gly Val Asp Met Leu Lys Leu Asp Gly Cys Tyr 144 

Ser Ser Gly Lys Glu Gin Ala Gin Gly Tyr Pro Gin Met Ala Arg Ala 160 

Leu Asn Ala Thr Gly Arg Pro lie Val Tyr Ser Cys Ser Trp Pro Ala 176 

5 Tyr Gin Gly Gly Leu Pro Pro Lys Val Asn Tyr Thr Leu Leu Gly Glu 192 

He Cys . Asn Leu Trp Arg Asn Tyr Asp Asp He Gin Asp Ser Trp Asp 208 

Ser Val Leu Ser He Val Asp Trp Phe Phe Thr Asn Gin Asp Val Leu 224 

Gin Pro Phe Ala Gly Pro Gly His Trp Asn Asp Pro Asp Met Leu He 240 

He Gly Asn Phe Gly Leu Ser Tyr Glu Gin Ser Arg Ser Gin Met Ala 256 

10 Leu Trp Thr He Met Ala Ala Pro Leu Leu Met Ser Thr Asp Leu Arg 272 

Thr He Ser Pro Ser Ala Lys Lys He Leu Gin Asn Arg Leu Met He 288 

Gin He Asn Gin Asp Pro Leu Gly He Gin Gly Arg Arg He He Lys 304 

Glu Gly Ser His He Glu Val Phe Leu Arg Pro Leu Ser Gin Ala Ala 320 

ser Ala Leu Val Phe Phe Ser Arg Arg Thr Asp Met Pro Phe Arg Tyr 336 

15 Thr Thr Ser Leu Ala Lys Leu Gly Phe Pro Met Gly Ala Ala Tyr Glu 352 

Val Gin Asp Val Tyr Ser Gly Lys He He Ser Gly Leu Lys Thr Gly 368 

Asp Asn Phe Thr He Val He Asn Pro Ser Gly Val Val Met Trp Tyr 384 

Leu Cys Pro Lys Ala Leu Leu He Gin Gin Gin Ala Pro Gly Gly Pro 400 
Ser Arg Leu Pro Leu Leu " — ' - - - - 406 

20 (4) INFORMATION FOR SEQ IP NO ; 3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 

(B) TYPE: amino acid 

(C) STRANDEDNESS : doxible 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: CDNA to mRNA 
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(iii) HYPOTHETICAL: no 

(iv) ANTI-SEMSE: yes 

(v) FRAOTENT TYPE: 

(vi) ORIGINAL SOURCE: 

5 (A) ORGANISM: hiiznan 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 
10 (F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii> IMMEDIATE SOURCE: library 

15 (viii) POSITION IN GENOME: unknown 

(A) CHROMOSOME SEGMENT: 

(B) HAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 
20 (A) NAME/KEY: h\jman a-N- 

acetylgalactosaminidase 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

25 (X) PUBLICATION INFORMATION: 

(A) AUTHORS: Wang et al 

(B) TITLE: Human a*N-Acetylgalactosaminidase 

Molecular Cloning, Nucleotide Sequence, 
and Expression of a Full-Length cDNA 
30 (C) JOURNAL: Journal of Biological Chemistry 

(D) VOLUME: 265 

(F) PAGES: 21859-21866 

(G) DATE: 1990 

(H) DOCUMENT NUMBER: 
35 (I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Leu Leu Lys Thr Val Leu Leu Leu Gly His Val Ala Gin Val Leu 16 

40 Met Leu Asp Asn Gly Leu Leu Gin Thr Pro Pro Met Gly Trp Leu Ala 32 
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Trp 


Glu 


Arg 


Phe 


Arg 


Cys 


Asn 


He 




Cys 


He 


Ser 


Glu 


Gin 


Leu 


Phe 


Met 




Asp 


Gly 


Trp 


Arg 


Asp 


Met 


Gly 


Tyr 




Trp 


He 


Gly 


Gly 


Arg 


Asp 


Ala 


Ser 


5 


Arg 


Phe 


Pro 


His 


Gly 


He 


Pro 


Phe 




Gly 


Leu 


Lys 


Leu 


Gly 


He 


Tyr 


Ala 




Gly 


Tyr 


Pro 


Gly 


Thr 


Thr 


Leu 


Asp 




Phe 


Ala 


Glu 


Trp 


Lys 


Val 


Asp 


Met 




Thr 


Pro 


Glu 


Glu 


Arg 


Ala 


Gin 


Gly 


10 


Asn 


Ala 


Thr 


Gly 


Arg 


Pro 


He 


Ala 




Glu 


Gly 


Gly 


Leu 


Pro 


Pro 


Arg 


Val 




Cys 


Asn 


Leu 


Trp 


Arg 


Asn 


Tyr 


Asp 




Val 


Leu 


Ser 


He 


Leu 


Asn 


Trp 


Phe 




Pro 


Val 


Ala 


Gly 


Pro 


Gly 


His 


Trp 


15 


Gly 


Asn 


Phe 


Gly 


Leu 


Ser 


Leu 


Glu 




Trp 


Thr 


Val 


Leu 


Ala 


Ala 


Pro 


Leu 




He 


Ser 


Ala 


Gin 


Asn 


Met 


Asp 


He 




He 


Asn 


Gin 


Asp 


Pro 


Leu 


Gly 


He 




Lys 


ser 


Leu 


He 


Glu 


Val 


Tyr 


Met 


20 


Ala 


Leu 


Val 


Phe 


Phe 


Ser 


Cys 


Arg 




Ser 


Ser 


Leu 


Gly 


Gin 


Leu 


Asn 


Phe 




Gin 


Asp 


Val 


Tyr 


Ser 


Gly 


Asp 


He 




Asn 


Phe 


Thr 


He 


Val 


He 


Asn 


Pro 




Tyr 


Pro 


He 


Lys 


Asn 


Leu 


Glu 


Met 
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Asn 


Cys Asp Glu Asp Pro Lys 


Asn 


48 


Glu 


Met Ala Asp Arg Met Ala 


Gin 


64 


Thr 


Tyr Leu Asn He Asp Asp Cys 


80 


Gly 


Arg Leu Met Pro Asp Pro 


Lys 


96 


Leu 


Ala Asp Tyr Val His Ser 


Leu 


112 


Asp 


Met Gly Asn Phe Thr Cys 


Met 


128 


Lys 


Val Val Gin Asp Ala Gin 


Thr 


144 


Leu 


Lys Leu Asp Gly Cys Phe 


Ser 


160 


Tyr 


Pro Lys Met Ala Ala Ala 


Leu 


176 


Phe 


Ser Cys Ser Trp Pro Ala 


Tyr 


192 


Asn 


Tyr Ser Leu Leu Ala Asp 


He 


208 


Asp 


lie Gin Asp Ser Trp Trp Ser 


224 


Val 


Glu His Gin Asp He Leu 


Gin 


240 


Asn 


Asp Pro Asp Met Leu Leu 


He 


256 


Gin 


Ser Arg Ala Gin Met Ala Leu 


272 


Leu 


Met Ser Thr Asp Leu Arg Thr 


288 


Leu 


Gin Asn Pro Leu Met He 


Lys 


304 


Gin 


Gly Arg Arg He His Lys 


Glu 


320 


Arg 


Pro Leu Ser Asn Lys Ala 


Ser 


336 


Thr 


Asp Met Pro Tyr Arg Tyr 


His 


352 


Thr 


Gly Ser He Val Tyr Glu 


Ala 


368 


He 


Ser Gly Leu Arg Asp Glu 


Thr 


384 


Ser 


Gly Val Val Met Trp Tyr 


Leu 


400 


Ser 


Gin Gin 




411 
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(5) INFORMATION FOR SEC ID NO : 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

<A) DESCRIPTION: cDNA to mRNA 

(iii) HYPOTHETICAL: no 
10 <iv> ANTI-SENSE: yes 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

( A ) ORGANISM : human 

(B) STRAIN: 

15 (C) INDIVIDUAL ISOLATE: 

(D) DEVELOPBIENTAL STAGE: 

<E) HAPLOTYPE: 

<F) TISSUE TYPE: 

(G) CELL TYPE: 

20 (H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: library 

(viii) POSITION IN GENOME: unknown 
(A) CHROMOSOME SEGMENT: 

25 (B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: hximan a-galactosidase 

(B) LOCATION: 

30 (C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: Calhoun et al 

(B) TITLE: Fabry Disease: Isolarion of a cDNA 

35 Clone Encoding Human a-Galactosidase A 

(C) JOURNAL: Proceedings of the National Academy 

of Science USA 

(D) VOLUME: 82 

(F) PAGES: 7364-7368 
40 (G) DATE: 1985 
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(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 





Met 


Gin 


Leu 


Arg 


Asn 


Pro 


Glu 


Leu 


His 


Leu 


Gly 


Cys 


Ala Leu 


Ala Leu 


16 




Arg 


Phe 


Leu 


Ala 


Leu 


Val 


Ser 


Trp 


Asp 


He 


Pro 


Gly 


Ala Arg Ala Leu 


32 




Asp 


Asn 


Gly 


Leu 


Ala 


Arg 


Thr 


Pro 


Thr 


Met 


Gly 


Trp 


Leu His 


Trp Glu 


48 




Arg 


Phe 


Met 


Cys 


Asn 


Leu 


Asp 


Cys 


Gin 


Glu 


Glu 


Pro 






64 


10 


Ser 


Glu 


Lys 


Leu 


Phe 


Met 


Glu 


Met 


Ala 


Glu 


Leu 


Met 


Val Cor 




80 




Trp 


Lys 


Asp 


Ala 


Gly 


Tyr 


Glu 


Tyr 


Leu 


Cys 


He 


Asp 




± U p Fits W 


96 




Ala 


Pro 


Gin 


Arg 


Asp 


Ser 


Glu 


Gly 


Arg 


Leu 


Gin 


Ala 


Asp Pro 


Gin Arg 


112 




Phe 


Pro 


His 


Gly 


He 


Arg 


Gin 


Leu 


Ala 


Asn 


Tyr 


Val 


His Ser 


Lys Gly 


128 




Leu 


Lys 


Leu 


Gly 


He 


Tyr 


Ala 


Asp 


Val 


Gly 


Asn 


Lys 


Thr Cys 


Ala Gly 


144 


15 


Phe 


Pro 


Gly 


Ser 


Phe 


Gly 


Tyr 


Tyr 


Asp 


He 


Asp 


Ala 


Gin Thr 


Phe Ala 


160 




Asp 


Trp 


Gly 


Val 


Asp 


Leu 


Leu 


Lys 


Phe 


Asp 


Gly 


Cys 


Tyr Cys 


Asp Ser 


176 




Leu 


Glu 


Asn 


Leu 


Ala 


Asp 


Gly 


Tyr 


Lys 


His 


Met 


Ser 


Leu Ala 


Leu Asn 


192 




Arg 


Thr 


Gly 


Arg 


Ser 


He 


Val 


Tyr 


Ser 


Cys 


Glu 


Trp 


Pro Leu 


Tyr Met 


208 




Trp 


Pro 


Phe 


Gin 


Lys 


Pro 


Asn 


Tyr 


Thr 


Glu 


He 


Arg 


Gin Tyr 


Cys Asn 


224 


20 


.His 


Trp 


Arg 


Asn 


Phe 


Ala 


Asp 


He 


Asp 


Asp 


Ser 


Trp 


Lys Ser 


He Lys 


240 




Ser 


He 


Leu 


Asp 


Trp 


Thr 


Ser 


Phe 


Asn 


Gin 


Glu 


Arg 


He Val 


Asp Val 


256 




Ala 


Gly 


Pro 


Gly 


Gly 


Trp 


Asn 


Asp 


Pro 


Asp 


Met 


Leu 


He Val 


Gly Asn 


272 




Phe 


Gly 


Leu 


Ser 


Trp 


Asn 


Gin 


Gin 


Val 


Thr 


Gin 


Met 


Ala Leu 


Trp Ala 


288 




He 


Met 


Ala 


Ala 


Pro 


Leu 


Phe 


Met 


Ser 


Asn 


Asp 


Leu 


Arg His 


He Ser 


304 


25 


Pro 


Gin 


Ala 


Lys 


Ala 


Leu 


Leu 


Gin 


Asp 


Lys 


Asp 


He 


Val Ala 


He Asn 


320 




Gin 


Asp 


Pro 


Leu 


Gly 


Lys 


Gin 


Gly 


Tyr 


Gin 


Leu 


Arg 


Gin Gly Asp Asn 


336 




Phe 


Glu 


Val 


Trp 


Glu 


Arg 


Pro 


Leu 


Ser 


Gly 


Leu 


Ala 


Trp Ala Val Ala 


352 
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Met He Asn Arg Gin Glu He Gly Gly Pro Arg Ser Tyr Thr He Ala 368 

Val Ala Ser Leu Gly Lys Gly Val Ala Cys Asn Pro Ala Cys Phe He 384 

Thr Gin Leu Leu Pro Val Lys Arg Lys Leu Gly Phe Tyr Glu Trp Thr 4 00 

Ser Arg Leu Arg Ser His He Asn Pro Thr Gly Thr Val Leu Leu Gin 416 

5 Leu Glu Asn Thr Met Gin Met Ser Leu Lys Asp Leu Leu 429 

(6) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: cDNA to mRNA 

(iii) HYPOTHETICAL: no 
15 (iv) ANTI-SENSE: yes 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: yeast Saccharomvces cerevisiae 

(B) STRAIN: 

20 (C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 
25 (H) CELL LINE: 

(1) ORGANELLE: 

(vii) IMMEDIATE SOURCE: library 

(viii) POSITION IN GENOME: unknown 
(A) CHROMOSOME SEGMENT: 

30 (B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: yeast a-galactosidase (MELD 

(B) LOCATION: 
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(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION; 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: Liljestrom 

5 (B) TITLE: The Nucleotide Sec[uence of the 

Yeast MELl Gene 

(C) JOURNAL: Nucleic Acids Research 

(D) VOLUME: 13 

(F) PAGES: 7257-7268 

10 (G) DATE: 1985 

(H> DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES: 



15 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 


5; 














Met Phe 


Ala Phe 


Tyr 


Phe 


Leu 


Thr 


Ala 


Cys 


He 


Ser 


Leu 


Lys 


Gly 


Val 


16 




Phe Gly 


Val Ser 


Pro 


Ser 


Tyr 


Asn 


Gly 


Leu 


Gly 


Leu 


Thr 


Pro 


Gin 


Met 


32 




Gly Trp 


Asp Asn 


Trp 


Asn 


Thr 


Phe 


Ala 


Cys 


Asp 


Val 


Ser 


Glu 


Gin 


Leu 


48 




Leu Leu 


Asp Thr 


Ala 


Asp 


Arg 


He 


Ser 


Asp 


Leu 


Gly 


Leu 


Lys 


Asp 


Met 


64 


20 


Gly Tyr 


Lys Tyr 


He 


He 


Leu 


Asp 


Asp 


Cys 


Trp 


Ser 


Ser 


Gly 


Arg 


Asp 


80 




Set Asp 


Gly Phe 


Leu 


Val 


Ala 


Asp 


Glu 


Gin 


Lys 


Phe 


Pro 


Asn 


Gly 


Met 


96 




Gly His 


Val Ala 


Asp 


His 


Leu 


His 


Asn 


Asn 


Ser 


Phe 


Leu 


Phe 


Gly 


Met 


112 




Tyr Ser 


Ser Ala 


Gly 


Glu 


Tyr 


Thr 


Cys 


Ala 


Gly 


Tyr 


Pro 


Gly 


Ser 


Leu 


128 




Gly Arg 


Glu Glu 


Glu 


Asp 


Ala 


Gin 


Phe 


Phe 


Ala 


Asn 


Asn 


Arg 


Val 


Asp 


144 


25 


Tyr Leu 


Lys Tyr" 


Asp 


Asn 


Cys 


Tyr 


Asn 


Lys 


Gly 


Gin 


-Phe 


Gly 


Thr 


Pro 


160 




Glu He 


Ser Tyr 


His 


Arg 


Tyr 


Lys 


Ala 


Met 


Ser 


Asp Ala 


Leu 


Asn 


Lys 


176 




Thr Gly 


Arg Pro 


He 


Phe 


Tyr 


Ser 


Leu 


Cys 


Asn 


Trp Gly 


Gin 


Asp 


Leu 


192 




Thr Phe 


Tyr Trp 


Gly 


Ser 


Gly 


He 


Ala 


Asn 


Ser 


Trp Arg 


Met 


Ser 


Gly 


208 




Asp Val 


Thr Ala 


Glu 


Phe 


Thr 


Arg 


Pro 


Asp 


Ser 


Arg 


Cys 


Pro 


cys 


Asp 


224 


30 


Gly Asp 


Glu Tyr 


Asp 


Cys 


Lys 


Tyr 


Ala 


Gly 


Phe 


His 


Cys 


Ser 


He 


Met 


240 




Asn He 


Leu Asn 


Lys 


Ala 


Ala 


Pro 


Met 


Gly 


Gin 


Asn 


Ala 


Gly 


Val 


Gly 


256 
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Gly Trp Asn Asp Leu Asp Asn Leu Glu Val Gly Val Gly Asn Leu Thr 272 

Asp Asp Glu Glu Lys Ala His Phe Ser Met Trp Ala Met Val Lys Ser 288 

Pro Leu lie lie Gly Ala Asn Val Asn Asn Leu Lys Ala Ser Ser Tyr 304 

Ser lie Tyr Ser Gin Ala Ser lie Val Ala lie Asn Gin Asp Ser Asn 320 

5 Gly lie Pro Ala Thr Arg Val Trp Arg Tyr Tyr Val Ser Asp Thr Asp 336 

Glu Tyr Gly Gin Gly Glu lie Gin Met Trp Ser Gly Pro Leu Asp Asn 352 

Gly Asp Gin Val Val Ala Leu Leu Asn Gly Gly Ser Val Ser Arg Pro 368 

Met Asn Thr Thr Leu Glu Glu lie Phe Phe Asp Ser Asn Leu Gly Ser 384 

Lys Lys Leu Thr Ser Thr Trp Asp lie Tyr Asp Leu Trp Ala Asn Arg 400 

10 Val Asp Asn Ser Thr Ala Ser Ala He Leu Gly Arg Asn Lys Thr Ala 416 

Thr Gly He Leu Tyr Asn Ala Thr Glu Gin Ser Tyr Lys Asp Gly Leu 432 

Ser Lys Asn Asp Thr Arg ** 438 

(7) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 411 

(B) TYPE: amino acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

20 (A) DESCRIPTION: cDNA to mRNA 

(iii) HYPOTHETICAL: no 

( iv ) ANT I -SENSE : yes 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

25 (A) ORGANISM: guar plant Cvamopsis tetraqonoloba 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 
(D> DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 
30 (F) TISSUE TYPE: 
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(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: library 

5 (viii) POSITION IN GENOME: unknown 

(A) CHROMOSOME SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

10 (A) NAME/KEY: guar a-galactosidase 

(B) LOCATION: 

<C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

15 (A) AUTHORS: Overbeeke et al 

(B) TITLE: Cloning and Nucleotide Sequence of 

the a-Galacrosidase cDNA From 
Cvamopsis tetraqonoloba (guar) 

(C) JOURNAL: Plant Molecular Biology 
20 (D) VOLUME: 13 

(F) PAGES: 541-550 

(G) DATE: 1989 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

25 (J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES: 







(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


: 6: 














Met 


Ala 


Thr 


His 


Tyr Ser lie 


He 


Gly Gly Met 


He 


He 


Val 


Val 


Leu 


16 




Leu 


Wet 


Ile_ 


_Ile 


Gly Ser Glu 


Gly 


Gly Arg Leu 


Leu 


Glu 


Lys 


Lys 


Asn 


32 


30 


Arg 


Thr 


Ser 


Ala 


Glu Ala Glu 


His 


Tyr Asn Val 


Arg 


Arg 


Tyr 


Leu 


Ala 


48 




Glu 


Asn 


Gly 


Leu 


Gly Gin Thr 


Pro 


Pro Met Gly 


Trp 


Asn 


Ser 


Trp 


Asn 


64 




His 


Phe 


Gly 


Cys 


Asp lie Asn 


Glu 


Asn Val Val 


Arg 


Glu 


Thr 


Ala 


Asp 


80 




Ala 


Met 


Val 


Ser 


Thr Gly Leu 


Ala 


Ala Leu Gly 


Tyr 


Gin 


Tyr 


He 


Asn 


96 




Leu 


Asp 


Asp 


Cys 


Trp Ala Glu 


Leu 


Asn Arg Asp 


Ser 


Glu 


Gly 


Asn 


Met 


112 


35 


Val 


Pro 


Asn 


Ala 


Ala Ala Phe 


Pro 


Ser Gly He 


Lys 


Ala 


Leu 


Ala 


Asp 


128 




Tyr 


Val 


His 


Ser 


Lys Gly Leu 


Lys 


Leu Gly Val 


Tyc 


Ser 


Asp 


Ala 


Gly 


144 
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Asn Gin Thr Cys Ser Lys Arg Met Pro Gly Ser Leu Gly His Glu Glu 160 

Gin Asp Ala Lys Thr Phe Ala Ser Trp Gly Val Asp Tyr Leu Lys Tyr 176 

Asp Asn Cys Glu Asn Leu Gly He Ser Val Lys Glu Arg Tyr Pro Pro 192 

Met Gly Lys Ala Leu Leu Ser Ser Gly Arg Pro lie Phe Phe Ser Met 208 

5 Cys Glu Trp Gly Trp Glu Asp Pro Gin He Trp Ala Lys Ser He Gly 224 

Asn Ser Trp Arg Thr Thr Gly Asp He Glu Asp Asn Trp Asn Ser Met 24 0 

Thr Ser He Ala Asp Ser Asn Asp Lys Trp Ala Ser Tyr Ala Gly Pro 256 

Gly Gly Trp Asn Asp Pro Asp Met Leu Glu Val Gly Asn Gly Gly Met 272 

Thr Thr Glu Glu Tyr Arg Ser His Phe Ser He Trp Ala Leu Ala Lys 288 

10 Ala Pro Leu Leu Val Gly Cys Asp He Arg Ala Met Asp Asp Thr Thr 304 

His Glu Leu He Ser Asn Ala Glu He Val Ala Val Asn Gin Asp Lys 320 

Leu Gly Val Gin Gly Lys Lys Val Lys Ser Thr Asn Asp Leu Glu Val 336 

Trp Ala Gly Pro Leu Ser Asp Asn Lys Val Ala Val He Leu Trp Asn 352 

Arg Ser Ser Ser Arg Ala Thr Val Thr Ala Ser Trp Ser Asp He Gly 368 

15 Leu Gin Gin Gly Thr Thr Val Asp Ala Arg Asp Leu Trp Glu His Ser 384 

Thr Gin Ser Leu Val Ser Gly Glu He Ser Ala Glu He Asp Ser His 400 

Ala Cys Lys Met Tyr Val Leu Thr Pro Arg Ser 411 

(8) INFORMATION FOR SEQ ID NO : 7: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 447 

(B) TYPE: amino acid 

( C > STRANDEDNESS : doub 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

25 (A) DESCRIPTION: cDNA tO mRNA 

(iii) HYPOTHETICAL: no 

(iv) ANTI-SENSE: yes 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Asperqillis niqer 

(B) STRAIN: 

5 . (C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 
10 (H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: library 

(viii) POSITION IN GENOME: xinknown 
(A) CHROMOSOME SEGMENT: 

15 (B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: Aspergillus niqer a-galactosidase 

(B) LOCATION: 

20 (C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

<x) PUBLICATION INFORMATION: 

(A) AUTHORS: den Herder et al 

(B) TITLE: Cloning and Expression of a Member 
25 of the Aspergillus niqer Gene Family 

Encoding a-Galacrosidase 

(C) JOURNAL: Molecular and General Genetics 

(D) VOLUME: 233 
(F) PAGES: 404-410 

30 (G) DATE: 1992 

- - . _ _ (H) DOCraENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES: 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met lie Gin Gly Leu Glu Ser He Met Asn Gin Gly Thr Lys Arg He 16 

Leu Leu Ala Ala Thr Leu Ala Ala Thr Pro Trp Gin Val Tyr Gly Ser 32 

He Glu Gin Pro Ser Leu Leu Pro Thr Pro Pro Met Gly Pro Asn Asn 48 

Trp Ala Arg Phe Met Cys Asp Leu Asn Glu Thr Leu Phe Thr Glu Thr 64 

40 Ala Asp Thr Met Ala Ala Asn Gly Leu Arg Asp Ala Gly Tyr Asn Arg 



80 
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He Asn Leu Asp Asp Cys Trp Met 
Ser Leu Gin Trp Asn Thr Thr Lys 
Ala Lys Tyr Val Lys Ala Lys Gly 
Ser Gly Asn Met Thr Cys Gly Gly 
5 Glu Gin Asp Ala Asn Thr Phe Ala 
Leu Asp Gly Cys Asn Val Tyr Ala 
Glu Tyr Lys Gin Arg Tyr Gly His 
Gin His Pro Leu He Phe Ser Glu 
Thr Asp Asn Asn Thr Asp Trp Tyr 

10 Tyr Gly Glu Leu Ala Arg His Ser 
Ala Gly Ser Ala Trp Asp Ser He 
Leu Leu Ala Arg Tyr Gin Arg Pro 
Leu He Pro Asp His Pro Gly Leu 
Phe Ala Leu Trp Ala Ser Phe Ser 

15 He Pro Ala Leu Ser Lys Asp Glu 
Leu He Ala Val Asn Gin Asp Pro 
Ser Arg Asp Asp Thr Leu Asp He 
Asp Arg Leu Leu Thr Val Leu Asn 
Asp He Pro Val Gin Trp Leu Gly 

20 Thr Ala Glu Asp Leu Trp Asp Gly 
He Lys He Glu Leu Ala Ser His 
Leu Pro Gin Gly Cys Ser Ser Val 
Thr Ala Ser Gly Asn Cys Leu Thr 

25 



Ala Tyr Gin Arg Ser 


Asp 


Asn 


Gly 


96 


Phe Pro His Gly Leu 


Pro 


Trp 


Leu 


112 


Phe His Phe Gly He 


Tyr 


Glu 


Asp 


128 


Tyr Pro Gly Ser Tyr 


Asn 


His 


Glu 


144 


Ser Trp Gly He Asp 


Tyr 


Leu 


Lys 


160 


Thr Gin Gly Arg Thr 


Leu 


Glu 


Glu 


176 


Trp His Gin Val Leu 


Ser 


Lys 


Met 


192 


Ser Ala Pro Ala Tyr 


Phe 


Ala 


Gly 


208 


Thr Val Met Asp Trp 


Val 


Pro 


He 


224 


Thr Asp He Leu Val 


Tyr 


Ser 


Gly 


240 


Met Asn Asn Tyr Asn 


Tyr 


Asn 


Thr 


256 


Gly Tyr Phe Asn Asp 


Pro 


Asp 


Phe 


272 


Thr Ala Asp Glu Lys 


Arg 


Ser 


His 


288 


nXa Mr^O mbU X±e X ±v 


Con 




ryr 


■9 U 4 


He Ala Phe Leu He 


Asn 


Glu 


Ala 


320 


Leu Ala Gin Gin Ala 


Thr 


Leu 


Ala 


336 


Leu Thr Arg Ser Leu 


Ala 


Asn 


Gly 


352 


Lys Gly Asn Thr Thr 


Val 


Thr 


Arg 


368 


Leu Thr Glu Thr Asp 


Cys 


Thr 


Tyr 


384 


Lys Thr Gin Lys He 


Ser 


Asp 


His 


400 


Ala Thr Ala Val Phe 


Arg 


Leu 


Ser 


416 


Val Pro Thr Gly Leu 


Val 


Phe 


Asn 


432 


Ala Ala Ser Asn Ser 


Ser 


Val 


* * 


447 
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WE CLAIM : 

1. A recombinant chicken liver a-N- 
acetylgalactosaminidase enzyme. 

2. A recombinant chicken liver a-N- 
5 acetylgalactosaminidase enzyme having a molecular weight of 

about 4 5 kDa. 

3. A recombinant chicken liver a-N- 
acetylgalactosaminidase enzyme which is immiinoreactive with an 
antibody specific for chicken liver a-N- 

10 acetylgalactosaminidase . 

4. A recombinant chicken liver a-N- 
acetylgalactosaxninidase enzyme having about 80% amino acid 
sequence homology with human a-t7-acetylgalactosaminidase 
enzyme . 

15 5. A recombinant chicken liver a-N- 

acetylgalactosaminidase enzyme including the amino acid 
sequence depicted in Figure 2, from amino acid number 1 to 
amino acid number 406. 

6. A recombinant vector containing a nucleotide 
20 sequence encoding chicken liver a*N-acetylgalactosaminidase . 

7. The recombinant vector of Claim 6 which includes 
the nucleotide sequence depicted in Figure 2, from nucleotide 
n\2mber 1 to nucleotide number 1218. 

8 . The DNA vector containing a sequence which 
25 encodes chicken liver a-N-acetylgalactosaminidase deposited 
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on March 17, 1993 with the American Type Culture Collection, 
Rockville, Maryland, and catalogued as ATCC No. 75434. 

9. A method of producing chicken liver 
a-N-acetylgalactosaminidase comprising culturing a cell 

5 transformed with a recombinant vector containing a nucleotide 
sequence encoding chicken liver a-N-acetylgalactosaminidase, 
and recovering chicken liver a-N-acetylgalactosaminidase from 
said culture. 

10. The method of Claim 9 wherein the recombinant 
10 vector includes the nucleotide sec[uence depicted in Figure 2, 

from nucleotide number 1 to nucleotide number 1218. 

11. The method of Claim 9 which further comprises the 
step of purifying active chicken liver 
a-N-acetylgalactosaminidase enzyme from said culture by 

15 affinity colxunn. 

12. The method of Claim 11 wherein the affinity 
column is aminocaproylgalactosylamine agarose. 

13. A method of removing A antigens from the surface 
of erythrocytes "cdmprising contacting said erythrocytes with 

20 recombinant chicken liver a-N-acetylgalactosaminidase enzyme 
for a period of time sufficient to remove said A antigens from 
the surface of said erythrocytes. 

14. The method of Claim 13 wherein the recombinant 
chicken liver a-N-acetylgalactosaminidase enzyme has a 

25 molecular weight of about 45 kDa. 
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15. The method of Claim 13 wherein the recombinant 
chicken liver a-N-acetylgalactosaminidase enzyme is 
immunoreactive with an antibody specific for chicken liver 
a-K-acetylgalactosaminidase. 
^ 16. The method of Claim 13 wherein the recombinant 

chicken liver a-N-acetylgalactosaminidase enzyme has about 
80% amino acid sequence homology with human 
a-N-acetylgalactosaminidase. 

17. The method of Claim 13 wherein the recombinant 
10 chicken liver a-N*acetylgalactosaminidase enzyme includes the 
amino acid sequence depicted in Figure 2, from amino acid 
number 1 to amino acid number 406. 
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ATG CTG GAG AAC GGG CTG GCG CGG ACC CCG CCC ATG GGC TGG TTG GCC 48 
Met Leu Glu Asn Gly Leu Ala Arg Thr Pro Pro Met Gly Trp Leu Ala 

TGG GAG CGG TTC CGC TGC AAC GTG AAC TGC CGG GAG GAC CCC CGC CAG 96 
Trp Glu Arg Phe Arg Cys Asn Val Asn Cys Arg Glu Asp Pro Arg Gin 

TGC ATC AGT GAG ATG CTC TTC ATG GAG ATG GCA GAC CGA ATA GCA GAG 144 
Cys He Ser Glu Met Leu Phe Met Glu Met Ala Asp Arg lie Ala Glu 

GAC GGC TGG AGG GAG CTG GGC TAC AAG TAC ATC AAT ATC GAT GAC TGC 192 
Asp Gly Trp Arg Glu Leu Gly Tyr Lys Tyr He Asn He Asp Asp Cys 

TGG GCC GCC AAG CAG CGT GAC ACT GAG GGG CGG CTG GTG CCT GAC CCC 240 
Trp Ala Ala Lys Gin Arg Asp Thr Glu Gly Arg Leu VaT Pro Asp Pro 

GAG AGG TTC CCC CGG GGC ATT AAG GCC TTG GCT GAC TAC GTT CAT GCC 288 
Glu Arg Phe Pro Arg Gly He Lys Ala Leu Ala Asp Tyr Val His Ala 

CGA GGC TTG AAG CTG GGC ATT TAT GGC GAC CTG GGC AGA CTC ACC TGT 336 
Arg Gly Leu Lys Leu Gly He Tyr Gly Asp Leu Gly Arg Leu Thr Cys 

GGA GGC TAC CCA GGC ACC ACG CTG GAC CGT GTG GAG CAG GAC GCA CAG 384 
Gly Gly Tyr Pro GTy Thr Thr Leu Asp Arg Val Glu Gin Asp Ala Gin 

11^ §9T GAG TGG G6T GTG GAC ATG CTG AAG CTA GAT GGG TGC TAC 432 
Thr Phe Ala Glu trp Gly VaT Asp Met Leu Lys Leu Asp Giy Cys Tyr 

TCA TCG GGG AAG GAG CAG OCA CAG GGC TAC CCA CAA ATG GCA AGG GCC 480 
Ser Ser Gly Lys Glu Gin Ala Gin Gly Tyr Pro Gin Met Ala Arg Ala 

TTG AAC GCC ACT GGC CGC CCC ATC GTC TAC TCC TGC AGC TGG CCA GCC 528 
Leu Asn Ala Thr Gly Arg Pro He VaT Tyr Ser Cys Ser Trp Pro Ala 

TAC CAG GGG GGG CTG CCT CCC AAG GTG AAC TAC ACT CTC CTG GGT GAG 576 
Tyr Gin GTy GTy Leu Pro Pro Lys VaT Asn Tyr Thr Leu Leu Gly Glu 

ATC TGC AAC CTG TGG CGG AAC TAC GAT GAC ATC CAG GAC TCA TGG GAC 624 
He Cys Asn Leu Trp Arg Asn Tyr Asp Asp lie Gin Asp Ser Trp Asp 

AGC GTG CTT TCC ATC GTG GAC TGG TTC TTC ACA AAC CAG GAT GTG CTG 672 
Ser VaT Leu Ser He VaT Asp Trp Phe Phe Thr Asn Gin Asp Val Leu 

CAG CCG TTT GCT GGC CCT GGC CAC TGG AAT GAC CCA GAC ATG CTC ATC 720 
Gin Pro Phe ATa GTy Pro GTy His Trp Asn Asp Pro Asp Met Leu He 

ATT GGA AAT TTC GGT CTC AGC TAT GAG CAG TCA CGT TCC CAA ATG GCC 768 
He Gly Asn Phe GTy Leu Ser Tyr Glu Gin Ser Arg Ser Gin Met Ala 

TTG TGG ACC ATT ATG GCA GCT CCA CTC CTC ATG TCC ACC GAC CTG CGC 816 
Leu Trp Thr lie Met Ala XTa Pro Leu Leu Met Ser Thr Asp Leu Arg 

ACT ATC TCG CCG AGT GCC AAG AAG ATT CTG CAG AAC CGC CTG ATG ATC 864 
Thr He Ser Pro Ser ATa Lys Lys He Leu Gin Asn Arg Leu Met He 

CAG ATA AAC CAG GAC CCC TTG GGA ATC CAG GGG CGC AGG ATC ATC AAG 912 
Gin He Asn Gin Asp Pro Leu Gly He Gin cTy Arg Arg lie He Lys 

GAG GGA TCC CAC ATT GAG GTG TTC CTG CGC CCG CTG TCA CAG GCT GCC 960 
Glu Gly Ser His He Glu VaT Phe Leu Arg Pro Leu Ser Gin Ala Ala 

AGT GCC CTG GTC TTC TTC AGC CGG AGG ACA GAC ATG CCC TTC CGC TAC 1008 
Ser ATa Leu VaT Phe Phe Ser Arg Arg Thr Asp Met Pro Phe Arg Tyr 
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